klotz: logs* + production engineering*

0 bookmark(s) - Sort by: Date ↓ / Title / - Bookmarks from other users for this tag

  1. This article discusses how traditional machine learning methods, particularly outlier detection, can be used to improve the precision and efficiency of Retrieval-Augmented Generation (RAG) systems by filtering out irrelevant queries before document retrieval.
  2. klogg is an open-source multi-platform GUI application for searching through text log files using regular expressions. It offers various features like handling large files, fast searching, and color-coded results.
  3. OpenTelemetry is not just an observability platform, it's a set of best practices and standards that can be integrated into platform engineering or DevOps.
  4. OpenLogParser, an unsupervised log parsing approach using open-source LLMs, improves accuracy, privacy, and cost-efficiency in large-scale data processing.

    Approach:
    - Log grouping: Clusters logs based on shared syntactic features.
    - Unsupervised LLM-based parsing: Uses retrieval-augmented approach to separate static and dynamic components.
    - Log template memory: Stores parsed templates for future use, minimizing LLM queries.

    Results:
    - Processes logs 2.7 times faster than other LLM-based parsers.
    - Improves average parsing accuracy by 25% over existing parsers.
    - Handles over 50 million logs from the LogHub-2.0 dataset.
    - Achieves high grouping accuracy (87.2%) and parsing accuracy (85.4%).
    - Outperforms other state-of-the-art parsers like LILAC and LLMParserT5Base in processing speed and accuracy.
  5. Linux log management can be a tricky process. This article guides you through best practices for managing logs on Linux systems.
    2024-08-04 Tags: , , by klotz
  6. Lnav is a log file viewer for large plain text files. It can handle files of any size and offers features like search, filter, and regex highlighting. It's built with C and supports Linux, macOS, and Unix systems.
    2024-06-21 Tags: , , , by klotz
  7. Hydrolix is a streaming data lake platform designed to handle large amounts of immutable log data at a lower cost than traditional solutions. The platform is particularly well-suited for observability data and offers real-time query performance on terabyte-scale data. Hydrolix uses an ANSI-compliant SQL interface, is schema-based and fully indexed, and is designed for high-cardinality data. It is purpose-built for log data and focuses on data that comes in once and never changes. Hydrolix is currently used by companies in industries like media, gaming, ad tech, and telecom security that require long-term retention of data. The company recently announced a $35 million Series B round, and its technology serves as the basis for Akamai's observability product TrafficPeak. The platform is designed to save costs for companies dealing with billions of transactions a day and terabytes of data, as it can store data for longer periods than traditional solutions like Splunk or Datadog, thereby reducing costs or increasing retention.
  8. With the addition of profiling to OpenTelemetry, we expect continuous production profiling to hit the mainstream.
  9. This article explains the differences between observability, telemetry, and monitoring, and how they work together to help teams understand and improve their software systems. It also discusses the benefits of using OpenTelemetry, a standard for creating and collecting telemetry for software systems, and Honeycomb's observability platform.
  10. OpenTelemetry offers a standardized process for observability, but its functionality is a work in progress. Its usefulness depends on the observability tools and platforms used in conjunction with OpenTelemetry.

Top of the page

First / Previous / Next / Last / Page 1 of 0 SemanticScuttle - klotz.me: Tags: logs + production engineering

About - Propulsed by SemanticScuttle